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Abstract 

We prove the curse of dimensionality for multivariate integration of C k functions. 
The proofs are based on volume estimates for k = 1 together with smoothing by 
convolution. This allows us to obtain smooth fooling functions for k > 1. 

MSC: 65D30,65Y20,41A63,41A55 
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1 Introduction 

We study multivariate integration for different classes F d of smooth functions /: M d — > M. 
Our emphasis is on large values of d G N. We want to approximate 

Sd(f)= ! f{x)dx for f E Fd (1) 

JDd 

up to some error e > 0, where Dd C M. d has (Lebesgue) measure 1. The results in this paper 
hold for arbitrary sets Dd, the standard example of course is Dd = [0, l] d . 

We consider (deterministic) algorithms that use only function values. We consider classes 
Fd of functions bounded in absolute value by 1 and containing all constant functions f(x) = c 
with |c| < 1. This implies that the initial error is one, i.e., 

inf max \S d (f) - c\ = max \S d (f)\ = 1, 

cm f€F d feF d 

so that multivariate integration is well scaled and that is why we consider e < 1. 

Let n(e, Fd) denote the minimal number of function values needed for this task in the 
worst case setting^]. By the curse of dimensionality we mean that n(e, Fd) is exponentially 
large in d. That is, there are positive numbers c, Eq and 7 such that 

n(e, F d ) > c (1 + 7) d for all e < e and infinitely many d £ N. (2) 

For many natural classes Fd the bound in ([2]) will hold for all d GN. This applies in particular 
to the classes considered in this paper. 

: We add that n(e, Fd) is the information complexity of multivariate integration over Fd and is proportional 
to the (total) complexity as long as F d is convex and symmetric. The last two assumptions are needed to 
guarantee that a linear algorithm is optimal and its implementation cost is linear in n(e,Fd). 
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There are many classes for which the curse of dimensionality has been proved, see [5J [7] 
for such examples. However, it has been not known if the curse of dimensionality occurs for 
probably the most natural class which is the unit ball of r times continuously differentiate 
functions, 

C r d = {feC r (R d )\ \\D^f\\<l for all \(3\ < r}, 

where (3 = (f3 1 , f3 2 , ■ ■ ■ , (3d), with non- negative integers (3j, \(3\ = ^2~ =1 /3j, and D 13 denotes 
the operator of (3j times differentiation with respect to the jth variable for j = 1, 2, . . . d. By 
|| ■ || we mean the sup norm, = sup xgR d \(D^ f)(x)\. 

For r = 0, we obviously have n(e,C d ) = oo for all e < 1 and all d G N. Therefore 
from now on we always assume that r > 1. For r = 1, the curse of dimensionality for C d 
follows from the results of Sukharev [§]. Whether the curse holds for r > 2 has been an open 
problem for many years. 

The class C d for D d = [0, l] d (and functions and norms restricted to D d ) was already 
studied in 1959 by Bakhvalov [2], see also [1]. He proved that there are two positive numbers 
dd, r and Ad t r such that 

a d ,r s~ d/r < n(e, C r d ) < A d , r e- d/T for all d e N and e e (0, 1). (3) 

This means that for a fixed d and for e tending to zero, we know that n(e, C r d ) is of order e~ d l r 
and the exponent of e" 1 grows linearl}H in d. Unfortunately, since the known dependence on d 
in ad,r is exponentially small and the known dependence on d in A d ^ r is exponentially large 
in d, Bakhvalov's result does not allow us to conclude whether the curse of dimensionality 
holds for the class C d . In fact, if we reverse the roles of d and e, and consider a fixed e and d 
tending to infinity, the bound (j3J) on n(e,C d ) is useless. We prove the following result. 

Main Theorem. The curse of dimensionality holds for the classes C d with the super- 
exponential lower bound 

n{e, C r d ) > c r (1 - e) d d ^ 2r+ ^ for all d G N and e G (0, 1), 

where c r G (0, 1] depends only on r. 

We also prove that the curse of dimensionality holds for even smaller classes of func- 
tions F d for which the norms of arbitrary directional derivatives are bounded proportionally 
to l/\fd. 

2 In the language of tractability, this result means that we do not have polynomial tractability but does 
not allow us to conclude the lack of weak tractability. 
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We now discuss how we obtain lower bounds on n(e, Fd) for numerical integration defined 
on convex and symmetric classes Fd- The standard proof technique is to find a fooling 
function / G Fd that vanishes at the points V — {x\, x 2 , ■ ■ ■ ,x n } at which we sample functions 
from Fd, and the integral of / is as large as possible. All algorithms that use function values 
at Xj's must give the same approximation of the integral of / and of the integral of — /. 
That is why the integral of / is a lower bound on the worst case error of all algorithms using 
function values at x^s. If for all choices of X\,x 2 , ...,x n the integral of / is larger than e 
then we know that n(e, Fd) > n. 

We start with the fooling function 

f (x) = min jl, ^= dist(x, for all x G R d , 

where 

n 

1=1 

and Bg(xi) is the ball with center x- t and radius 8yd. The function f is Lipschitz. By a 
suitable smoothing via convolution we construct a fooling function f r G C r d and f r \-p = 0. 



2 Preliminaries 

In this section we precisely define our problem. Let Fd be a class of Lebesgue integrable 
functions / : R d — > R. For / G Fd, we approximate the integral Sd(f), see (pQ), by algorithms 

A n ,d(f) = ( Pn,d(f(xi),f(x 2 ),...,f(x n )), 

where Xj G R d can be chosen adaptively and (f) n ^ '■ R™ — > R is an arbitrary mapping. 
Adaption means that the selection of Xj may depend on the already computed values 
f(xi), f(x 2 ), • • • , f(xj^i). The (worst case) error of the algorithm A n>d is defined as 

e(A»,d) = sup \S d (f) - A n4 (f)\. 

fdF d 

The minimal number of function values to guarantee that the error is at most e is defined as 

n(e,F d ) = min{ n G N | 3 A n ^ d such that e(A n ^ d ) < e}. 

Hence we minimize n over all choices of adaptive sample points Xj and mappings <fi n ,d- It 
is well known that as long as the class F d is convex and symmetric we may restrict the 
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minimization of n by considering only nonadaptive choices of Xj and linear mappings 4> n ,d- 
Furthermore, 

n(e,F d ) = mm\n G N | inf sup \S d (f)\ < A (4) 

I -p cR d i#v=n feFd j\ p=0 ) 

see [H Prop. 1.2.6] or [9, Theorem 5.5.1]. In this paper we always consider convex and 
symmetric F d so that we can use the last formula for n(e,F d ). For more details see, e.g., 
Chapter 4 in [5]. 

Observe that we allow Xj G M. d instead of only Xj G D d . In this paper we are interested 
in lower bounds and this assumption makes our results even stronger. 

As already mentioned, our lower bounds are based on a volume estimate of a neighbor- 
hood of certain sets in ~R d , see also pj. In the following we denote by Ag, for A C M. d , the 
(5v / rf)-neighborhood of A, which is defined by 

A s = {x G R d | dist(x, A) < 5\Q}, (5) 

where dist(x, A) = inf ae ^ \\x — a, 1 1 2 denotes the Euclidean distance of x from A. 

Since we need the A/rf-scaling of the distance, we will omit it in the notation as we already 
did for A$. Furthermore, we denote by B d (x) the <i-dimensional ball with center x G M. d and 
radius 5\fd, i.e., 

B$(x) = {y G R d | ||ac - y\\ 2 < 5Vd}. 

We will need some standard volume estimates for Euclidean balls. Recall that the volume 
of a Euclidean ball of radius 1 is given by 



d r(i + d/2)" 

From Stirling's formula for the T function we have 

T(x + 1) = V2ttx x x e~ x+ ^ for all x > 0, 
where 6 X G (0, 1), see [U p. 257]. This leads to the estimate 

T(x + 1) > V2t^x {^j X for all x > 0. 
Combining this estimate with the volume formula for the ball, we obtain for all d G N, 

\ d {B d (x)) = (6Vd) d V d < (5vQ) d ^Z = l^jSL < (sV2^Y. (6) 
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The volume formula for the Euclidean unit ball also shows the recurrence relation 



V d d-l v d _ 2 

This easily implies 



for all d > 4. 



— = — — < . — — for all d > 4. 

V d 

The last inequality can be used in an inductive argument leading to 

2 Vd_i 



This will be needed later. 



< 1 for all d > 2. (7) 



3 Convolution 

In this section we fix k G N and study the convolution 

fk ■= f * 9i* ■ ■ ■* 9k 

of a function / defined on R d with (normalized) indicator functions gj. We are interested 
in properties of in terms of the properties of the initial function /. Recall that the 
convolution of two functions / and g on R d is defined by 

(f*g)(x)= f f{x-t)g(t)dt for all x eR d . 

Fix a number 5 > and a sequence (aj)j =1 with ctj > such that 

k 

5><1. 

i=i 

For example, we may take ctj — 1/k for j — 1, 2 . . . , k. For j = 1, we define the ball 

Bj = {x eR d \ \\x\\ 2 < aijSVd} 
and the function gj : R d — > R by 

= ^ = ^- J 1 if " GB - (8) 
Xd(Bj) ^d(Bj) 1 otherwise. 



Thus, the convolution of a function / with gj can be written as 

{f * 9j){ x ) — T~7~n\ f ffr + t)dt for all x ER d . 

We will frequently use the following probabilistic interpretation. Let Yj be a random variable 
that is uniformly distributed on B y Then the convolution of / with can be written as 
the expected value 

(f* 9j )(x) = Elfix + Yj)]. 

The next theorem is the basis for the induction steps of the proofs of our main results. 
For / : W d — > R, we use the Lipschitz constant 

j ■ m \f(x)-f(v)\ 
Lip(/) = sup — j: n . 

x^y \\X — y\\2 

Define 

C r = {/: R d R | D 0e ... D 9l f is continuous for all t < r and all X , . . . , 9 r E S^ 1 }, 

where S^" 1 is the unit sphere in M. d and D 9l f(x) = lim^o f (/(a? + — /(a;)) is the 
derivative in the direction of Q\. 

Theorem 1. For fceN and f E C r , define 

fk = f * 9i* ■ ■ ■* 9k with #fc from (jEJ). 

For d > 2, let fi C R d and let ftj be its neighborhood defined as in (151). Then 

(i) if /(x) = for all x E Qs then fk{x) = for all x E Q, 

(ii) Lip(/ fc ) < Lip(/), 

(m) if J D /(x + t) dx > e for all t G M d with ||t|| 2 < then J" D f k (x)dx > e, 
(iv) for all £ < r and all 9 U 6 2 , . . . , r G 

Lip (l>^ D^- 1 . . . D 6 * 1 / fc ) < Lip (V f D 8 '- 1 . . . l/ 1 /) , 

(v) f k E C r+k , and for all i < r, all j = 1, . . . , k and all X , 2 , . . . , G S*" 1 , 
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The parts (i)-(iv) of this theorem show that some properties of the initial function / are 
preserved by convolutions. Part (v) states that we gain one "degree of smoothness" with 
every convolution, loosing only a multiplicative constant for its Lipschitz constant. 

Proof. First note that we can write f k as 

f k (x) = E[f(x + Y)] , for all x E R d , 

where Y is a random variable with probability density function gi* ■ ■ ■* g k - By construction 
of g k s which are the indicator functions of the balls whose sum of the radii is at most 5\/d, 
we have 

{t e R d | 0i * ... * g k (t) > 0} c {t e R d \ \\t\\ 2 < dVd}, 

which implies that x + Y E £1$ almost surely for every x E fi. Thus, f(x) = for all x EVt§ 
implies that f k (x) = for all x E Q, which is property (i). 
Property (ii) is proven by 

\fk(x) - f k {y)\ = \K[fk(x + Y)-f k (y + Y)] \ < E[\f(x + Y)-f(y + Y 
< Lip(/)E[||(x + y)-(y + y)|| 2 ] = Lip(/)||x-y|| 2 . 

To prove (in), we use Fubini's theorem and we obtain 



/ f k (x)dx= [ E[f(x + Y)] dx = l[ / f(x + Y)dx 

JD d JD d k/Z) d 



> € 



by assumption. 

For the proof of properties (iv) and (v), let 9 — (9i, . . . , 6i) E (S^ 1 ) 1 . We write D e for 
... D e K Clearly, / E C r and I < r implies that D e f E C r ^ C C. Since f k is at least as 

smooth as /, both D 6 f and D 6 f k are well defined. 

We need the well-known fact that D e (f * g) = (D e f) * g if / E C e and g has compact 

support. For g — g\ * . . . * g k , we have 

\D e f k {x) - D e f k {y)\ = \((D e f)*g)(x)-((D e f)*g)(y)\ 

[ [(D e f(x + t)-D e f(y + t)} g(t)dt 

< Up(D e f) \\x-y\\ 2 [ g(t)dt 



for all x,y E 



= Up(D e f) \\x-y\\ 2 
The last equality follows since the g k is normalized. This proves (in) 
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For (v), we need to prove that f k G C r+k with f Q = f G C r by assumption, and then it 
is enough to show that for all m < r + k and all 9 = (8 m , . . . ,9\) G (S^ 1 )" 1 , 



where = (0 m _i, . . . , 0i) G (S^ 1 )™- 1 . 

Assume inductively that G C m_1 , which holds for = 1. This implies D e (f k _i*g k ) = 
(Lflfk-i) *9k, and 



D e f k (x) = D e -((D S f k ^)*g k )(x) 

= D HtTr\I D S fk-i(x + t)l Bi (t)dt) 



= t— tttt D 6m ( [ I D s f k ^(x + s + he m )i Bk ( s + he m )dhds) 

= / D ° m ( I DS fk-i(x + s + h6 m ) l Bk (s + h6 m ) dh) ds, 

where 8^ is the hyperplane orthogonal to m . For any function / on I of the form 

rx+a 

f(x) = / g(y)dy 

J x—a 

with some continuous function g we have 

f'(x) = g(x + a) - g(x - a). 

Therefore, we obtain 

D°fk(x) = y^ttt / D^f k _ 1 (x + s + h max (s)e m ) 

- &h-\ (x + s- h max (s) 6„}j ds 

with 

^max(s) = max{/i > I s + hd m G B k }. 
For each s G B k (~) 9^, define the points s± = s + h max (s) 9 m G B k and 
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s 2 = s - h max (s) 9 m e B k . Then 



\D e f k (x)-D e f k (y)\ < 



< 



^d(B k 



+ 



D~ 9 f k ^{x + Sl ) -D s f k . 1 (x+s 2 ) 
D 9 f k ^{y + Sl ) + D°f k _ 1 {y + s 2 ) 
D°f k _ 1 (x + s 1 ) -D S f k ^(y + Sl ) 
D S f k ^(x + s 2 ) -D d f k ^{y + s 2 ) 



ds 



ds 



In particular, this shows the implication 

Jk-1 t t/ 



, 2Ad_i(5 fc n^) . ... ,. 
< : — : Lip(-D fk-i) \\x-y\\ 2 . 



fk e C r < 



for all fcsN. Taking m = r + k we have G C r+k , as claimed. 

For m < r + it remains to bound 2\d~\(B k 9^)/ Xd(B k ). Recall that B k is a ball with 
radius 5a k \/d and that Vd is the volume of the Euclidean unit ball in W 1 . We obtain from 
© that 

2A„_i(S fc nO _ 2(5a fc v / 5) d - 1 ^-i _ 2 



^d(B k ) 



(5a k \/d) d Vd 5a k \fd Vd 



V d -! < _1_ 



□ 



4 Main Results 

Let P = {xi, . . . ,x n } C M d be a collection of n points. As pointed out in the introduction, 
we want to construct functions that vanish at V and have a large integral. For this, we 
choose 

fo(x) = min jl, dist(x, for all x E M d , 



where 



V S = \jB d s ( Xl ) 



i=i 



and Bf(xi) is the ball with center X{ and radius 5yfd. 
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The function dist(-,Vg) is Lipschitz with constant 1. Hence for S < 1 

Lip(/o) = (9) 

Additionally, fo(x) = for all x £ Vs by definition. 

Using these facts we can apply Theorem [I] to prove the curse of dimensionality for the 
following class of functions that are defined on R d . For a fixed r £ N, we now take ai = 
• • • = a r — - and define 



d,r,8 



{f:R d f eC r satisfies JIDJ-^E}}, 



where 



< 1, (10) 

Lip(/) * 

V^r^JJJ^,!*^...^/) < ^ (J)*. (12) 
Theorem 2. For any r G N and 5 G (0, 1], 

for d — 1 

n(e,F dir>(5 ) > (1-e) { ~ e , ^ d c , ' for all e £ (0, 1). 

tor a > 2, 




Hence the curse of dimensionality holds for the class Fd }T ,s for 5 < 1/ \/l8e7r. 

Note that this result shows that the growth rate of n(e, F^ r ,s) in d can be arbitrarily 
large if we choose 5 small enough. 

Proof. Since the initial error for the classes F^ r ,s is 1 we obtain n(e, F^r.s) — 1 fo r all 
e £ (0, 1). This proves the statement for d — 1. 

For d > 2, we use Theorem [T] with k = r, Q = V and / r (x) = /o * <7i * ■ ■ ■ * 9r{x)- Here 
(7/-'s are as in Theorem [U Recall that we have chosen «i = . . . = a r = 1/r and <Xj = for 
j > r. The properties of the initial function fa and Theorem [1] immediately imply that f r 
satisfies (|T0|) - (|T2|) . It remains to bound its integral. Note that fo(x) = 1 for all x V25- 
Clearly, f r (x) > for all x £ lR d . Since / r (x) depends only on the values fo(x + t) for t £ M d 
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with || 1 1| 2 < 5\/~d, it follows that f r (x) = 1 for x £ P 3 g. We thus obtain 



f r (x)dx > / f r (x) dx = 1 - X d (V 3S D D d ) 

D d JD d \V- iS 

> 1 - Xd(V 3S ) > 1 - n\ d (Bi & ) 

d 



71 

> 1 



nd 

d 



where the last inequality follows from the bound that is given in fl6]). Hence f D f r (x) dx < e 
implies that 

n> (1-e) (5Vl8en)- d . 
Since this holds for arbitrary V, the result follows. □ 
By Theorem [2], we know how the parameter 5 comes into play For p > 0, let 

1 :d- p ^ +1 \ 



For this S, we obtain a somehow stronger form of the curse of dimensionality for the class 
F d ,r,p = {f:R d ^R\ f EC r satisfies (P]) -(fT51)}. 

where 



< 1, (13) 
Lip(/) < cH + *+i Vl8eTT, (14) 

\/k<r: max Lipf// 1 . . . D 0k f) < d^ +Ei ^ r k (Vl8evr) fc+1 . (15) 

0i,...,0 fe eS d - 1 v ' 

Theorem 3. For any r G N and p > 0, 

n{e, F d ^ p ) > (1-e) d pd ^ r+1 *> for all d G N and e G (0, 1). 
Hence the curse of dimensionality holds for the class Fd, r ,p- 
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Note that the classes F^ r , P are contained in the classes 

C r d = {feC r | H^/ll <1 for all |/3|<r}, 
if p < 1/2 and d is large enough. This holds if 



d > (r r (18enY r+1 ^ 2 ) 1/{1/2 * '. (16) 



From this we easily obtain the main result already stated in the introduction. 

Main Theorem. For any r G N, there exists a constant c r G (0, 1] such that 

n{e, C d ) > c r (1 - e) d d/(2r+3) for all d G N and e G (0, 1). 
Hence the curse of dimensionality holds for the class Q. 

Proof. The case d = 1 is trivial since the initial error for the classes C r d is again 1. 

For d > 2, we know from Theorem [3] and the discussion thereafter that n(e,C d ) > 
(1 — e) d pd /( r+1 ) for all p < 1/2 if d > do, where d = do(r,p) is the right hand side of (TLB"]) . 
This implies that for 



^r,p d| 



-pcfo/(r+l) 



which depends only on r and p, we have 



n(e, CS) > c r , p (1 - e) d pd/(r+1) for all d > 2. 

The choice p* = (r + 1)/ (2r + 3) yields the result with c r = c rjP .. □ 

Note that c r in the last theorem is super exponentially small in r. 

Remark 1. The reader might find it more natural to define classes of functions F d r .(D d ) that 
are defined only on D d C M. d . Not all such functions can be extended to smooth functions 
on R d , and even if they can be extended then the norm of the extended function could be 
much larger. Our lower bound results for functions defined on M. d can be also applied for 
functions defined on D d C W 1 and this makes them even stronger. 

Remark 2. Note that the possibility of super-exponential lower bounds on the complexity 
depends on the definition of the Lipschitz constant. For the class 

/:[0,1]^M| sup L^WM<! 

x,ye[0,l] d \\ X y\\oo 

Sukharev [S] proved that for n = m d the midpoint rule is optimal with error e n = 2^2 n_1 ^° '■ 
Hence, roughly, n(e, F d ) w 2~ d e~ d and the complexity is "only" exponential in d for e < 1/2. 
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Remark 3. We mention two results for the very small class 



Fd = Cf = {f eC°°([0,l} d ) | \\D^f\\ <1 for all f3 G Nq}. 

0. Wojtaszczyk [TU] proved that lim^oo n(e, Fa) = oo for every e < 1, hence the problem 
is not strongly polynomially tractable. It is still open whether the curse of dimensionality 
holds for this class F^. The same class F^ was studied for the approximation problem in [6]. 
For this problem the curse of dimensionality is present even if we allow algorithms that use 
arbitrary linear functionals. 
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